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Orton-Gillingham (OG) and Orton-Gillingham-based reading instructional programs are commonly 
implemented reading programs in the United States. Twelve studies that employed quasi-experimental 
or experimental designs are reviewed. These studies included elementary students, adolescents, and 
college students. Of the 12 studies, 5 reported that the OG instruction was more effective than were 
comparison or control interventions for all measured outcomes, 4 reported that the OG instruction was 
more effective for at least 1 (but not all) outcomes in comparison to other intervention(s), 2 reported 
that the alternate instruction was more effective than the OG instruction, and 1 reported no significant 
differences once covariates were included. The largest effects were reported for word attack and non- 
word reading outcomes, with mean effect size of .82, and comprehension outcomes, with a mean effect 
size of .76. Following a summary of each study, limitations of the current review and implications for 
future research and practice are described. Given the small number of studies, the lack of method- 
ological rigor of the existing studies, and the inconclusive findings of the effectiveness of OG pro- 
grams, additional research is needed before the scientific basis can be established. 


In the early part of the 20th century, the field of learning dis- 
abilities was still in its infancy. At that time, Dr. Samuel Orton, 
a child neurologist, estimated that approximately 10% of the 
school population had reading disabilities and speculated that 
reading problems were caused by neurological deficits. Orton’s 
early conclusions formed two major tenets of the characteristics 
of dyslexia: (a) reading problems were caused by strephosym- 
bolia (or twisted symbols), and (b) a lack of brain hemisphere 
dominance caused information to be processed in both hemi- 
spheres, resulting in mirror images (e.g., reversals of the let- 
ters b and d). 

Although Orton’s speculated causes of reading disabili- 
ties have not stood the test of time (phonological processing 
problems are currently accepted as the primary cause of read- 
ing disabilities), his ideas about appropriate reading instruction 
have. Specifically, Orton (1937) stated that the instructional 
approach should 

attempt to capitalize [sic] their [students’] auditory 
competence by teaching them the phonetic equiva- 
lents of the printed letters and the process of blend- 
ing sequences of such equivalents so that they might 
be able to produce for themselves the spoken form 
of the word from its graphic counterpart, (p. 159) 


The instructional approach conceived by Orton was de- 
veloped into a curriculum by Anna Gillingham and Bessie 
Stillman and first described in the manual Remedial Training 
for Children with Specific Disability in Reading, Spelling, and 
Penmanship (1960). Currently in its eighth edition (Gilling- 
ham & Stillman, 1997), the manual remains the backbone of 
Orton-Gillingham (OG) instructional programs and other in- 
structional programs derived from the original curriculum. 

Characteristics of OG-Based Instruction 

The OG approach is a systematic, sequential, multisensory, 
synthetic and phonics-based approach to teaching reading. 
Explicit instruction is provided in phonology and phono- 
logical awareness, sound-symbol correspondence, syllables, 
morphology, syntax, and semantics. A key characteristic of 
OG reading instruction is that it is multisensory, involving vi- 
sual, auditory, and kinesthetic/tactile learning pathways, often 
referred to as the Language Triangle. Instruction directly 
teaches language components, is provided systematically and 
cumulatively, requires mastery and overlearning before stu- 
dents advance to new components, is based on ongoing diag- 
nostic information and assessment, and is individualized to 
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the needs of each student (Clark & Uhry, 1995). As a final 
point, OG instruction is to be provided by trained and quali- 
fied teachers, tutors, or other specialists. 

Related reading programs have evolved from the origi- 
nal teaching methodology. Several authors worked directly with 
Orton, Gillingham, or Stillman to make their adaptations. Other 
adaptations were developed based on specific needs, such as 
modifying the approach to address whole class instruction or 
to meet the needs of adult learners. Well-known adaptations 
and extensions include Alphabetic Phonics (Cox, 1992), Wil- 
son Reading System (Wilson, 1996), The Herman Method 
(Herman, 1993), Project ASSIST (Biasotto, 1993), The Sling- 
erland Approach (Slingerland & Aho, 1994-1996), The Spal- 
ding Method (Spalding & Spalding, 1990), Starting Over 
(Knight, 1995), and Project Read (Enfield & Greene, 1997). 
Although this is not an exhaustive list, the programs listed 
above include materials, training, and research or evaluation. 
Across curricula, the underlying instructional principles are 
consistent with the original OG methodology, although the 
targeted age group, instructional setting, materials, or other 
differences may exist. 

Need for Evidence From 
Scientifically Based Research 

Since their development, OG and OG-based reading instruc- 
tion have been commonly accepted and frequently delivered 
interventions for students with reading disabilities. Although 
the standard of identifying and using the best instructional 
practices is customary to special education, it appears that the 
widespread use of OG instruction has been fueled by anecdotal 
evidence and personal experience. It is possible, for exam- 
ple, that in comparison to general education reading instruc- 
tion practices, other commonly used basal reading programs, 
and/or teachers’ “eclectic” attempts at addressing reading dif- 
ficulties, the relative effectiveness of OG experienced by prac- 
titioners is overwhelmingly positive. As with any intervention, 
however, adequate scientifically based evidence of the effec- 
tiveness of OG and OG-based programs should be sought be- 
fore such instruction is provided for students. 

Two current legal mandates require a scientific evidence 
base for reading instruction. First, No Child Left Behind 
(NCLB; 2002) legislation, which includes the Reading First 
initiatives, requires that schools employ “a learning system or 
program of reading instruction based on scientifically-based 
reading research” (§ 1202 [c] [7] [A] ). New federal requirements 
define scientifically-based reading research as research that 
“applies rigorous, systematic and objective procedures to ob- 
tain valid knowledge relevant to reading development, reading 
instruction, and reading difficulties” (No Child Left Behind, 
2002, Part B, Subpart 1, Section 1208(6)(A)). Second, OG or 
OG-based reading instruction often is specifically requested 
by parents and professionals for students receiving special 
education services. Bhat, Rapport, and Griffin (2000) reviewed 


27 legal decisions based on due process cases in which parents 
requested specific reading methods. In 15 of the 27 cases (de- 
cided between 1989 and 1998), parents requested OG or a 
specific OG-based program (e.g.. The Slingerland Approach, 
1994-1996, or the Wilson Reading System, 1996). In an ad- 
ditional 4 cases, a language-based and/or multisensory pro- 
gram was requested. 

Given the number of children who receive OG and OG- 
based reading instruction, as well as the current legal climate, 
it is important to establish whether there is, at this time, suf- 
ficient evidence to suggest that OG and OG-based reading 
instruction programs meet the requirements of scientifically- 
based reading instruction. The purpose of this paper is to re- 
view the literature on OG and OG-based reading instruction 
programs in comparison to other instructional approaches. 
Following a summary of research studies, methodological is- 
sues that confound understanding of the effectiveness of OG 
and OG-based programs will be described. Finally, implica- 
tions for further research and instruction are discussed. 

Method 

To locate articles to include in this review, we searched the 
ERIC, ECER, and Psychlnfo databases using the following key 
words: Orton Gillingham, multisensory, reading, decoding, 
and phonics, as well as the titles and authors of all OG-based 
curricula. We also searched the ProQuest dissertation data- 
base using the same terms and beginning with 1980. The ref- 
erence lists of identified articles were examined for additional 
references. Finally, a hand search was conducted of the fol- 
lowing journals: Annals of Dyslexia (titled The Bulletin of the 
Orton Society until 1981), Journal of Learning Disabilities , 
Learning Disabilities Quarterly, Exceptional Children, and 
Journal of Special Education, beginning with the earliest avail- 
able issue through 2005. 

Studies were selected for inclusion if they: (a) were pub- 
lished in a peer-refereed journal or were a doctoral disserta- 
tion, (b) studied a multisensory reading approach based on OG 
methods or philosophy, (c) were conducted using experimen- 
tal, quasi-experimental, or single-subject research designs, and 
(d) included a sample size of at least 10 participants per experi- 
mental condition. Descriptive studies, pilot studies, program 
evaluations, and studies employing only a pretest-posttest 
design (without a control or comparison treatment) were ex- 
cluded (e.g., Hutcheson, Selig, & Young, 1990; Ogden, Hind- 
man, & Turner, 1989; Vickery, Reynolds, & Cochran, 1987). 
Three additional studies that applied OG principles to foreign 
language teaching were excluded (Sparks & Ganschow, 1993; 
Sparks, Ganschow, Kenneweg, & Miller, 1991; Sparks, Gan- 
schow, Pohlman, Skinner, & Artzer, 1992). A corpus of 12 stud- 
ies met the criteria for inclusion in this review. 

All studies were coded for pertinent information, includ- 
ing participant characteristics, setting, experimental design, 
treatment conditions, comparison and/or control conditions, 
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dependent variables, and results. Each author coded each study, 
and interrater agreement was calculated using the following 
formula: Number of Agreements + Number of Agreements + 
Disagreements. Interrater agreement across categories ex- 
ceeded 99%. 

Information for calculating effect sizes also was coded 
from each study. To determine effect sizes {d, Cohen, 1988), 
we identified and then used a spreadsheet to record the mean, 
the standard deviations, and the number of participants for 
treatment and control/comparison groups. We calculated ef- 
fect sizes using the following formula: Mean of OG Treatment 
Group - Mean of Comparison/Control Group + Pooled SD. 
When sufficient descriptive statistics were unavailable, effect 
sizes were estimated from F or t statistics, using the formula 
provided by Rosenthal (1991). When there were more than 
two treatment groups, effect sizes were calculated for both 
treatment groups. Interrater agreement for effect sizes ex- 
ceeded 99%. Effect sizes can be interpreted as d = .2 as a small 
effect, d = .5 as a medium effect, and d = .8 as a large effect 
(Cohen, 1988). 

Results 

Twelve studies (ten articles and two dissertations) reporting 
the results of experimental or quasi-experimental research 
studies met the inclusion criteria. Table 1 provides descriptive 
information for each study, including (a) OG or OG-based in- 
structional program, (b) comparison or control treatment(s), 
(c) experimental design, (d) participants, (e) setting, (f) out- 
come measures, (g) results, and (h) effect sizes. The results of 
the reviewed studies are presented by first describing each 
study and then summarizing the results across outcome mea- 
sures and overall findings. 

Elementary Students 

Studies that investigated the effectiveness of OG approaches 
with elementary students varied in terms of how the approach 
was applied. In some cases, the OG approach was the primary 
method of reading instruction in general education classrooms. 
In others, the approach was used as an intervention method 
for students who were considered to be at risk for reading fail- 
ure or as a secondary intervention for students with reading 
disabilities. The results of studies conducted with elementary 
students are organized according to instructional setting; first, 
the results of school-based interventions are reported, fol- 
lowed by clinic-based intervention studies. 

School-Based Interventions. Stoner (1991) investigated 
the effectiveness of Project Read (Enfield & Greene, 1997) in 
a quasi-experimental design that compared Project Read to 
traditional basal reading instruction for first-, second-, and 
third-graders who were considered at risk for reading prob- 
lems. Teachers implemented instruction in general education 


classrooms, and the progress of students was compared to stu- 
dents who had received traditional basal instruction the pre- 
vious year (previous cohort). For first-grade children, there were 
significant differences on the Stanford Achievement Test in 
word study (d = 1.15), in word reading (d = 1.06), in com- 
prehension ( d = .93), and in total reading score (d = 1.15). All 
differences favored the Project Read intervention. For second- 
and third-grade children, there were no significant differences 
on any of the outcome measures. An additional analysis ex- 
amined the results for children in classrooms where the 
teacher taught in the control condition the previous year and 
Project Read in the second year. We do not describe those re- 
sults here, as the analysis duplicates the finding of the entire 
sample. 

Litcher and Roberge (1979) investigated OG instruction 
as an early intervention program for first-grade students iden- 
tified as at risk for reading problems by screening measures 
given to approximately 600 students in the school district. 
Twenty students were identified for each of 3 years and as- 
signed to experimental interventions by matched pairs. Each 
year, students were taught OG reading and language instruc- 
tion 3 hours a day and were compared to a group of students 
receiving traditional basal instruction in matched schools. At 
the end of first grade, students who received the OG instruc- 
tion performed significantly higher than did the comparison 
group on the subtest and total scores of the Gates MacGinitie 
Reading Test (MacGinitie & MacGinitie, 1989) and the Met- 
ropolitan Achievement Test. Effect sizes for each year ranged 
from .75 to 2.88, indicating that the OG program produced 
greater performance (calculated from raw scores) than did tra- 
ditional basal instruction for readers at risk for reading prob- 
lems. 

Foorman et al. (1997) also investigated the effects of Cox’s 
(1992) Alphabetic Phonics (a synthetic phonics instructional 
program) for second- and third-grade students with reading 
disabilities in a quasi-experimental design. Two other instruc- 
tional methods were used as comparison treatments: a sight 
word program (Edmark Reading Program, 1984) and an ana- 
lytic phonics program (a modification of Recipe for Reading; 
Traub, & Bloom, 1992) that taught word identification at the 
onset-rime level). Students received one of the three types of 
reading instruction for 60 min per day across the school year 
and were assessed four times during the intervention. Growth 
curve analysis was used to model growth in phonological pro- 
cessing, orthographic processing, and word reading and to test 
for treatment effects. For phonological processing, the syn- 
thetic phonics instruction (OG) group significantly outper- 
formed the analytic phonics instruction group (d = .37) and 
sight word instruction groups (d = .59) when controlling for age. 
When other covariates (gender, socioeconomic status, ethnic- 
ity, and verbal IQ) were controlled for, the synthetic phonics 
(OG) groups and analytic phonics groups no longer differed, 
although the synthetic group outperformed the sight word 
instruction group. Verbal IQ was a significant correlate. For or- 
thographic processing, there were effects for age (older students 
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& Lindamood, 1979); MIRC = Multisensory integrated reading and composition; OG = Orton Gillingham reading instruction; NTRT = Nelson Denney Reading Test; SAT = Stanford Achievement Test; SDRT = Stanford Diagnostic Reading Test (Karlsen 
& Gardner, 1985); TOPA = Test of Phonological Awareness (Torgesen & Bryant, 1994); TORC = Test of Reading Comprehension (Brown, Hammill, & Wiederholt, 1978); TOWL = Test of Written Language (Hammill & Larsen, 1988); WRMT = Wood- 
cock Reading Mastery Test (Woodcock, 1973, 1989, or 1998); WRAT = Wide Range Achievement Test (Jastak & Wilkinson, 1984); WJ-R = Woodcock Johnson Psycho-Educational Battery-Revised (Woodcock & Johnson, 1989). 
a /V indicates number of participants after accounting for attrition. ^Indicates an unpublished dissertation. 
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had higher end-of-year performance and rates of growth than 
younger students did) and the synthetic phonics (OG) group 
had higher end-of-year scores than the analytic phonics group 
had ( d = .29), but not the sight word group ( d = .05). When 
the other covariates were controlled for, there were no signif- 
icant differences between the three instructional conditions, 
and ethnicity was a significant correlate. 

A similar pattern was found for word reading. Older stu- 
dents outperformed younger students, and the synthetic phon- 
ics (OG) instruction yielded higher word reading than analytic 
phonics instruction and sight word instruction did ( d = .39 and 
.17, respectively), but those effects were no longer significant 
when covariates were controlled for; socioeconomic status 
was the significant correlate in this case. Finally, the effects of 
the three instructional conditions on word reading were re- 
examined while controlling for students’ initial proficiency 
with phonological and orthographic processing. There were 
no longer significant treatment effects once initial skills were 
included in analysis. However, students who began the school 
year with higher phonological processing skills developed 
word reading skills at a higher rate than did students with 
lower initial skills (significant slope interaction). In sum, with- 
out controlling for covariates, the synthetic phonics (OG) in- 
struction led to (a) greater gains in phonological processing and 
word reading than did both other instructional conditions and 
(b) greater gains in orthographic processing than did the ana- 
lytic phonics instruction. However, when demographic vari- 
ables, verbal IQ, and initial skills were examined as covariates, 
there were no significant differences between the three in- 
structional methods. These findings suggest that examination 
of covariates is an important consideration in analyzing the 
effects of different types of reading instruction and that the 
effectiveness of one instructional program over another may 
be influenced by student characteristics. 

Joshi, Dahlgren, and Boulware-Gooden (2002) imple- 
mented the Language Basics: Elementary curriculum (as cited 
in Joshi et al., 2002) (based on Cox’s 1992 Alphabetic Phonics) 
in first-grade general education classrooms. Two classes were 
taught using Language Basics, and their progress was com- 
pared to two first-grade classrooms taught using the Houghton 
Mifflin reading program in a quasi-experimental design. At 
the end of first grade, both groups demonstrated significant 
growth in comprehension. Only the OG-based group made 
significant growth in word attack and phonological awareness. 
The Language Basics groups had significantly higher scores 
on word attack ( d = 1.08), phonological awareness (d = .44), 
and comprehension ( d = .81). 

In the final elementary school-based study, Westrich- 
Bond (1993) used a quasi-experimental design to examine the 
effects of OG instruction as compared to basal reading instruc- 
tion using the Ginn basal series. The participants were stu- 
dents with learning disabilities who received special education 
services in either resource room classrooms or self-contained 
special education classrooms. Students in both classroom types 
received either the OG reading instruction or basal reading 


instruction for a total of four conditions (resource room with 
OG reading, resource room with basal reading, self-contained 
with OG reading, and self-contained with basal reading). 
Reading instruction occurred during four sessions per week. 
The Woodcock Reading Mastery Test (Woodcock, 1989) Word 
Identification and Word Attack subtests were used as the mea- 
sures of real word and non-word reading. There were signif- 
icant gains scores for word attack and word identification 
from pretest to posttest (after controlling for initial reading 
skill), but there were no significant differences between the 
two instructional conditions (OG vs. basal reading instruction). 
There were, however, differences in gain scores for the type 
of special education classrooms. Gains in word attack scores 
were greater for students in self-contained classrooms than for 
students in resource room classrooms, while gains in word 
reading scores were greater for students in self-contained 
classrooms than for students in resource room classrooms. 
However, there were no significant differences between the 
type of reading instruction for real word reading (for resource 
room d = -.40, and for self-contained special education class- 
room d = -1.40) or word attack (for resource room d = .07, 
and for self-contained special education classroom d = .03) 
after controlling for initial word reading skill. Additional an- 
alysis was also conducted that controlled for the number of 
months of instruction, but there were still no significant dif- 
ference between the two types of reading instruction (OG vs. 
basal reading). 

Clinic-Based Interventions. Oakland, Black, Stanford, 
Nussbaum, and Balise (1998) investigated an OG-based re- 
mediation program known as the Dyslexia Training Program 
(DTP), developed at the Texas Scottish Rite Hospital. Based 
on Alphabetic Phonics (Cox, 1992), DTP offers both teacher- 
directed instruction and a video-directed instructional pro- 
gram to be used by teachers who have not had extensive 
training in the program. Using a quasi-experimental design, 
Oakland et al. compared the use of teacher-directed instruc- 
tion and video-directed instruction to a control group that par- 
ticipated in the resource room remedial program. Students 
(mean age =11 years, SD = 2) received instruction 1 hour per 
day, 5 days a week for 2 years. Students receiving DTP (re- 
gardless of whether it was the teacher-directed or the video- 
directed program) significantly outperformed the students in 
the control treatment on measures of comprehension (d = .65), 
word reading (d = .73), spelling (d = .24) and decoding mono- 
syllabic words (d = .45) and polysyllabic words (d = .80). Stu- 
dents in the DTP group outperformed their counterparts in the 
control group despite lower initial skills in word reading. 

The final elementary-school-age study was conducted 
by Hook, Macaruso, and Jones (2001), who compared the ef- 
ficacy of Fast ForWord (a computer-based instructional pro- 
gram designed to improve auditory processing skills) to OG 
instruction. In this summer program, students (7-12 years old) 
either received OG reading instruction or participated in Fast 
ForWord activities. Matched pairs of students (matched by 
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IQ, phonemic awareness ability, reading level, and age) re- 
ceived one of the intervention treatments. Pre- and posttest as- 
sessments included phonemic awareness, word identification, 
and word attack (additional oral language and phonological 
processing measures also were collected for the Fast ForWord 
treatment and a longitudinal control group, but are not dis- 
cussed here). At the end of the summer program, both groups 
made significant growth in phonemic awareness (assessed on 
the Lindamood Auditory Conceptualization Test, Lindamood 
& Lindamood, 1979). For reading, the OG group made sig- 
nificant growth in Word Attack, whereas the Fast ForWord 
groups did not. Neither group made significant growth in 
Word Identification. Effect sizes comparing posttest differ- 
ences indicated small effects for phonemic awareness ( d = 
.11) and word identification (d = .07), but large effects for Word 
Attack (d = .77). 

Middle School and Secondary Students 

Two studies investigated the effectiveness of OG with adoles- 
cents. Simpson, Swanson, and Kunkel ( 1992) investigated read- 
ing remediation for students (aged 13-18 years old) placed in 
juvenile detention facilities. In a quasi-experimental design, 
students who received 90 min per day (5 days per week) of 
OG-based instruction were compared to a comparison group 
of students who received 45 min per day of traditional read- 
ing instruction in a second detention facility. Of the 1 16 students 
who began the study, 63 students were available for posttest- 
ing. The OG instruction resulted in more reading growth — as 
assessed by the Woodcock Reading Mastery Test (Woodcock, 
1989) — than did the remedial English condition. Because of 
large differences in the amount of instructional time (both 
among students and between experimental conditions), ad- 
ditional analysis was conducted to predict the rate of reading 
growth relative to hours of reading instruction. In this case, 
students who received OG instruction, on average, outper- 
formed the comparison condition. Analysis of individual cases 
indicated that not all students made growth. The authors con- 
cluded that students were likely to make more growth if they 
received more hours of instruction. However, given the spe- 
cific circumstances of the participating students, not all stu- 
dents were at the detention facility long enough to receive an 
adequate amount of instruction to make growth. 

Dooley (1994) compared the effects of reading instruc- 
tion that incorporated multisensory reading instructional prin- 
ciples and cooperative learning. Using a quasi-experimental 
design, multisensory integrated reading instruction (MIRC) 
was compared to traditional instruction for middle school stu- 
dents in remedial reading classes. Students participated in in- 
struction for one semester, and reading and writing outcomes 
were assessed. For all measures except one (general vocabu- 
lary, d = .01), the MIRC group statistically significantly out- 
performed the control condition (word attack d = .46, reading 
rate d = .83, syntactic similarities d = .44, paragraph reading 
d = .42, sentence sequencing d = .5 1). The intervention in this 


study also included specific instruction in writing, and stu- 
dents demonstrated statistically significant improvements in 
writing. Students who participated in MIRC had significantly 
higher performance on thematic maturity (d = 1.15) and con- 
textual style (d = .79). 

College Students 

Three studies investigated the effectiveness of OG and OG- 
based instructional programs for college students. Only one 
study reported sufficient statistics to calculate effect sizes. 

For two studies, the participants were college students 
enrolled in Marshall University’s HELP (Higher Education 
for Learning Problems) programs. Guyer and Sabatino (1989) 
reported positive results for a multisensory phonics program 
for students with dyslexia enrolled in a 5-week summer pro- 
gram. Participants received either phonetic reading instruc- 
tion using OG instruction, nonphonetic reading instruction 
using a basic reading skills curriculum, or were part of a non- 
intervention comparison group. At the end of the summer 
program, there were significant differences among groups in 
performance on the Wide Range Achievement Test (WRAT; 
Jastak & Wilkinson, 1984) Reading subtest and Woodcock 
Reading Mastery Test ( WRMT; Woodcock, 1973) Total Read- 
ing score. The authors report that there was differential re- 
sponse to instruction and that the OG groups made greater 
amounts of progress than did groups with the nonphonetic ap- 
proach or those in the no-intervention control group. Although 
there was a relatively small sample size ( N = 30), the partic- 
ipants demonstrated gains over a relatively short period of 
time. 

In the second college-level study, Guyer, Banks, and 
Guyer (1993) investigated the effectiveness of the Wilson 
Reading System (Wilson, 1996) for improving spelling skills. 
College students with dyslexia who requested reading and 
spelling intervention were randomly assigned to two instruc- 
tional conditions and compared to a non-intervention control. 
The two intervention groups were provided with a semester 
(two 1-hour sessions per week for 16 weeks) of spelling in- 
struction using the Wilson program or instruction in a non- 
phonetic (whole word) approach to spelling. Using the WRAT 
(Jastak & Wilkinson, 1984) Spelling subtest as the outcome 
measure, students who received spelling instruction under the 
Wilson program significantly improved their spelling perfor- 
mance, whereas the nonphonetic approach and no-intervention 
groups did not demonstrate gains. 

In the third college-level study. Chandler, Munday, Tun- 
nell, and Windham (1993) compared OG instruction using Al- 
phabet Phonics (Cox, 1992) to a traditional remedial reading 
instructional approach, which focused on comprehension skills, 
reading efficiency, study skills, and test-taking strategies in a 
quasi-experimental design. Community college students par- 
ticipated in a course for one of two semesters. Chandler et al. 
reported findings in contrast to those of Guyer et al. (1993), 
specifically, that the students in the control group, who re- 
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ceived traditional reading instruction, performed significantly 
higher than did the students in the OG group. OG instruction 
did improve the reading performance of college students in 
developmental reading courses; however, a traditional devel- 
opmental course using a textbook and focusing on vocabulary 
and comprehension development was more effective in in- 
creasing overall reading levels, as measured by the Nelson 
Denney Reading Test (d = -1.18). 

Across the studies that targeted college students, one im- 
portant finding was that adults with reading disabilities may 
benefit from continued reading instruction. Guyer et al. (1993) 
and Guyer and Sabatino (1989) identified OG instruction as 
more effective than other approaches in improving reading 
and spelling skills, whereas Chandler et al. (1993) reported 
that the OG instruction was less effective than a traditional re- 
medial reading course typically implemented by community 
colleges. All three studies were conducted with a small num- 
ber of participants, used a limited number of measures to as- 
sess effectiveness, and had other methodological flaws that 
could limit the strength of this evidence. 

Summary 

The findings of this literature review are summarized in two 
ways. First, we summarize the results of the effect sizes find- 
ings across studies. This allows us to draw conclusions about 
which reading and reading-related skills are affected by OG 
and OG-based instructional programs. Second, we summarize 
the overall outcomes of each study. This provides a more com- 
prehensive picture of how much support exists for the efficacy 
of OG and OG-based instruction. 

Effect Sizes. We coded all studies for the type of read- 
ing or reading-related outcome and then classified outcomes 
into the following categories: word identification (reading real 
words), word attack or nonword reading (reading phonetically 
regular nonwords), comprehension, vocabulary, fluency, spell- 
ing, total reading, phonological/phonemic awareness, and or- 
thographic awareness. We then calculated the mean effect size 
for each outcome. (Multiple effects sizes for a single outcome 
within a single study were averaged prior to being aggregated 
across studies.) 

The greatest effect, on average, for the 12 studies was for 
word attack or nonword reading, with a mean effect of 0.82, 
which is considered a large effect (7 studies). Effect sizes 
ranged from 0.02 to 1.94. For word identification, the mean 
effect size was 0.42, with effect sizes ranging from -.091 to 
1.56 (6 studies). The mean effect size for total reading was 
0.37, ranging from -1.18 to 1.59 (4 studies). The mean effect 
size for comprehension was 0.76, ranging from 0.17 to 1.68 
(5 studies). For phonological awareness, the mean effect size 
was 0.34, ranging from .11 to 0.48 (3 studies). Two studies 
measured vocabulary (M d = 0.8 1 , ranging from 0.01 to 1 .60), 
and effect sizes for spelling (d = 0.24) could be calculated 
from 1 study. Effect sizes for fluency could be calculated from 


1 study (d = 0.83), and orthographic awareness was examined 
in 1 study (Foorman et al., 1997; d = .29 to .05 for the two 
comparison conditions). The largest mean effect for OG and 
OG-based reading program was for word attack/nonword 
reading and comprehension outcomes, with a small to medium 
effect for real word reading. Other mean effect sizes are in the 
small to medium range. Mean effect sizes should be inter- 
preted with some caution given the wide range of effect sizes 
within each outcome category and the use of different mea- 
sures of each outcome. 

Comparison Results. Twelve studies that compared 
OG or OG-based instructional procedures to another reading 
approach were reviewed. Five studies (Guyer et al., 1993; 
Guyer & Sabatino, 1989; Joshi et al., 2002; Litcher & Roberge, 
1979; Simpson et al., 1992) reported that OG instruction was 
more effective than one or both comparison approaches for 
all outcomes examined in the study. Four additional studies 
(Dooley, 1994; Flooket al., 2001; Oakland et al., 1998; Stoner, 
1991) reported that OG instruction was more effective than 
one or both of the comparison approaches for at least one, but 
not all, outcomes. Two studies reported that the alternate in- 
structional condition was more effective (Chandler et al., 1993; 
Westrich-Bond, 1993). Finally, Foorman et al. (1997) reported 
that relative effectiveness of OG instruction depended on the 
examination of covariates, and initial findings of the superi- 
ority of OG instruction to analytic phonics instruction and/or 
sight word reading instruction was reduced when covariates 
were controlled for. 

Discussion 

The purpose of this paper was to review the empirical research 
on OG and OG-based reading interventions. Twelve studies 
( 10 published articles and 2 dissertations) provided what could 
be best described as a preliminary evidence base of the ef- 
fectiveness of OG and OG-based programs in comparison to 
other instructional approaches. A synthesis of the effective- 
ness of OG and OG-based reading instruction, the need for 
methodological rigor in the study of such programs, and im- 
plications for future research and practice are discussed in the 
following sections. 

Effectiveness of OG 
and OG-Based Programs 

The central purpose of this literature review was to summa- 
rize the scientific research to determine the effectiveness of 
OG and OG-based instructional programs. Studies that cited 
positive outcomes for OG and OG-based instruction reported 
effects on various outcomes. Positive results were found for 
word reading (Guyer & Sabatino, 1989; Flook et al., 2001; 
Litcher & Roberge, 1979; Stoner, 1991), word attack/decoding 
(Flook et al., 2001; Joshi et al., 2002; Litcher & Roberge, 1979; 
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Stoner, 1991), spelling (Guyer et al., 1993; Oakland et al., 
1998), and comprehension (Joshi et al., 2002; Litcher & Ro- 
berge, 1979; Oakland et al., 1998; Stoner, 1991). Positive out- 
comes for OG and OG-based instruction were reported across 
settings and populations. For beginning readers, positive effects 
were reported for first-grade children in general education 
classrooms, for elementary children at risk for or identified 
with reading disabilities and served in public schools, and 
elementary-age children in clinical settings. Positive effects 
were also found in two studies of college students with diag- 
nosed learning disabilities (Guyer et al., 1993; Guyer & Saba- 
tino, 1989).The findings were not, however, all positive in 
favor of OG instructional programs. Nor were the findings sta- 
tistically significant favoring either instructional program. For 
example. Chandler et al. (1993) reported that the alternate in- 
structional condition was more effective than OG instruction. 
In addition, there were nonsignificant results for all partici- 
pants reported by Westrich-Bond (1993), for participants in 
second and third grade reported by Stoner (1991), and for spe- 
cific outcomes reported by Oakland et al. (1998) and by Hook 
et al. (2001). 

We find it interesting that effects on vocabulary were re- 
ported in only two studies and that only one study examined 
reading fluency as an outcome of instruction. Since being 
identified by the National Reading Panel (2000) report as two 
of five essential components for reading, these two areas are 
currently receiving increased research attention. It is impor- 
tant to note that the primary components of OG instruction 
typically do not explicitly address these areas of instruction, 
although several programs do include opportunities for in- 
struction or practice and/or suggestions in these areas. For OG 
programs to demonstrate their effectiveness for all aspects of 
reading, perhaps they will need to be expanded or supple- 
mented with other reading instruction approaches that either 
build vocabulary and fluency or more systematically study 
these specific outcomes. 

Despite the wide acceptance and enthusiasm for OG 
and OG-based programs, not all studies reported them to be 
superior, and caution should be taken when attempting to gen- 
eralize any of the reviewed results. Differences in study par- 
ticipants, setting, location, program type, instruction time, the 
OG instructional programs and implementation, and outcome 
measures must be considered when evaluating this research. 

Need for Rigorous Scientifically Based 
Research of OG 

The process of reviewing and summarizing OG and OG-based 
reading instruction research was challenging due to the small 
number of extant studies that employed experimental or quasi- 
experimental designs. Several studies were eliminated from 
this review because only pretest and posttest data were re- 
ported, because the report was described as a pilot study, or 
because the study included a small number of participants (i.e., 
n < 10). Given the popularity of OG and OG-based instruc- 


tional methodologies, we were surprised that only a dozen 
studies met the inclusion criteria of our review. We used a 
rather liberal inclusion criterion to allow for examination of 
the majority of extant studies. If we were to require a more 
stringent criterion, the scant number of studies would provide 
little information from which to draw any conclusions. 

Additionally, methodological issues present in many of 
the reviewed studies complicate any conclusions that could be 
drawn. All but 1 of the 12 studies used a quasi-experimental 
design, in which groups or classrooms of students (instead 
of individual students) were assigned to experimental con- 
ditions. Seven out of 12 studies had small sample sizes (total 
samples n < 50, see Table 1). Several studies were published 
in the 1970s and 1980s, when standards for educational re- 
search were less stringent and when technology for statistical 
analysis was not readily available to researchers. Reports of 
these older studies did not include components that are con- 
sidered compulsory in any high-quality research report today 
(Gersten et al., 2005). Several of the more recent articles also 
did not report such information as details regarding the pro- 
cedures used to ensure comparability or equivalency of treat- 
ment groups in quasi-experimental designs, fidelity with which 
instruction was implemented, technical characteristics of the 
measures used as dependent variables, or scope of training 
provided to instructors. The absence of such information makes 
it impossible to determine the internal validity of the research 
and suggests that, at this time, there is insufficient evidence 
to conclude that OG and OG-based reading instruction meet 
the requirements of scientifically-based reading instruction. 
The methodological rigor of future OG research must be im- 
proved or strong conclusions regarding its effectiveness can- 
not be drawn. 

This review of literature also raises areas for future re- 
search. The majority of the studies simply identify whether 
OG instruction is more effective than the comparison treat- 
ments). Given the number of variations in OG and OG-based 
programs, research examining the relative effectiveness of 
these programs seems warranted. Research that examines in- 
structional setting (e.g., individual, small-group, whole class 
instruction), as well as the effectiveness of different compo- 
nents of OG (e.g., phonology and phonological awareness, 
sound-symbol correspondence, and syllabication), would help 
to specify the conditions under which OG instruction is ef- 
fective. Finally, it is important to identify for whom OG in- 
struction is most effective and the types of responses that can 
be expected for students with different instructional needs. 

Limitations 

A limitation of the current review was the exclusion of re- 
search that was not published in peer-refereed journals or a 
doctoral dissertations. Conference presentations, book chap- 
ters, and research available through other venues (i.e., Web 
sites, publisher information, and other reports) were not in- 
cluded, but may provide additional support for the efficacy of 
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OG and OG-based reading instruction. However, peer review 
and publication in refereed journals is a necessary condition 
to determine the methodological rigor of research studies. The 
studies reviewed here have met at least the minimum require- 
ments of the journal in which they were published or had been 
conducted with the expert consultation of a dissertation com- 
mittee. 

Summary and Conclusions 

In sum, the extant research literature provides both evidence 
that supports, as well as evidence that fails to support, the ef- 
fectiveness of OG instruction in reading, when compared to 
other reading instruction. This review also highlights the dis- 
parity between research and practice. For decades, educational 
researchers have lamented the “research to practice gap” — 
that is, the reluctance of classroom teachers to implement ed- 
ucational practices that have been developed and validated by 
researchers (see. Gersten, Chard, & Baker, 2000; Greenwood 
& Abbott, 2001). In the case of OG instruction, the opposite 
phenomenon — a “practice to research gap” — appears to exist. 
Despite widespread use by teachers in a variety of settings for 
more than 5 decades, OG instruction has yet to be compre- 
hensively studied and reported in peer-refereed journals. The 
small number of existing studies lack methodological rigor 
that would be required for publication in current peer-refereed 
journals. 

The major conclusion of the review is that the research 
is currently inadequate, both in number of studies and in the 
quality of the research methodology, to support that OG in- 
terventions are scientifically based. Given the inconclusive and 
mixed nature of the extant research findings, it may be pre- 
mature to reconsider the implementation and use of OG read- 
ing instruction programs for children with reading disabilities. 
OG instructional programs continue to be implemented by 
teachers who find it an effective reading instruction program 
in a variety of settings. However, it is necessary to scientifi- 
cally determine (a) if OG and OG-based instructional inter- 
ventions are effective for all students with reading disabilities, 
(b) for which children OG reading instruction is most effec- 
tive, (c) the effectiveness of OG-based reading programs in 
comparison to other remedial approaches, and (d) the specific 
conditions under which OG is most effective. Only then can 
the scientific basis of such reading instruction be assured. 
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